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PCT 



RAW SEQUENCE LISTING DATE: 02/17/2005 

PATENT APPLICATION: US/10/522,341 TIME: 12:04:43 

Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.raw 



C- 

C- 



2 <110> APPLICANT: Kock, Michael 

3 Frank , Markus 

4 Badur, Ralf 

6 <120> TITLE OF INVENTION: Novel Selection Processes 

8 <130> FILE REFERENCE: 532622010400 

10 <140> CURRENT APPLICATION NUMBER: US/10/522,341 

11 <141> CURRENT FILING DATE: 2005-01-25 

13 <160> NUMBER OF SEQ ID NOS : 71 

15 <170> SOFTWARE: Patentln Ver. 2.1 



ERRORED SEQUENCES 



W--> 



9)6 




<210> SEQ ID NO: 4 
<211> LENGTH: 42 7 
<212> TYPE: PRT 

J^-jORGANXSM : Art 
<220> FEATURE 

2- 3 > OTHER -— TffiFQR] 




Doss -Not Comply 
Corrected Diskette Needed , 



icia^ sequence I 

ON: Description of the artificial sequence: coding for 
cytosine deaminase (codA) 



357 


<4 00> SEQUENCE : 


4 
























358 


Met 


Ser 


Asn 


Asn 


Ala 


Leu 


Gin 


Thr 


He 


He 


Asn 


Ala 


Arg 


Leu 


Pro 


Gly 


359 


1 








5 










10 










15 




361 


Glu 


Glu 


Gly 


Leu 


Trp 


Gin 


He 


His 


Leu 


Gin 


Asp Gly 


Lys 


He 


Ser 


Ala 


362 








20 










25 










30 






364 


He 


Asp 


Ala 


Gin 


Ser 


Gly 


Val 


Met 


Pro 


He 


Thr 


Glu 


Asn 


Ser 


Leu 


Asp 


365 






35 










40 










45 








367 


Ala 


Glu 


Gin 


Gly 


Leu 


Val 


He 


Pro 


Pro 


Phe 


Val 


Glu 


Pro 


His 


He 


His 


368 




50 










55 










60 










370 


Leu 


Asp 


Thr 


Thr 


Gin 


Thr 


Ala 


Gly 


Gin 


Pro 


Asn 


Trp 


Asn 


Gin 


Ser 


Gly. 


371 


65 










70 










75 










80 


373 


Thr 


Leu 


Phe 


Glu 


Gly 


He 


Glu 


Arg 


Trp 


Ala 


Glu 


Arg 


Lys 


Ala 


Leu 


Leu 


374 










85 










90 










95 




376 


Thr 


His 


Asp 


Asp 


Val 


Lys 


Gin 


Arg 


Ala 


Trp 


Gin 


Thr 


Leu 


Lys 


Trp 


Gin 


377 








100 










105 










110 






379 


He 


Ala 


Asn 


G\ Y 


He 


Gin 


His 


Val 


Arg 


Thr 


His 


Val 


Asp 


Val 


Ser 


Asp 


380 






115 




















125 








382 


Ala 


Thr 


Leu 


Thr 


Ala 


Leu 


Lys 


Ala 


Met 


Leu 


Glu 


Val 


Lys 


Gin 


Glu. Val 


383 




130 










135 










140 










385 


Ala 


Pro 


Trp 


He 


Asp 


Leu 


Gin 


He 


Val 


Ala 


Phe 


Pro 


Gin 


Glu 


Gly 


He 


386 


145 










150 










155 










160 


388 


Leu 


Ser 


Tyr 


Pro 


Asn 


Gly 


Glu 


Ala 


Leu 


Leu 


Glu 


Glu 


Ala 


Leu 


Arg Leu 


389 










165 










170 










175 






file://C:\CRF4\Outhold\VsrJ52234 1 .htm 
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RAW S£Q9SNCE LISTING 

PATENT APPLICATION: US/10/522,341 



^^^: 

TIME: 



02/17/2005 
12:04:43 



W--> 



Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.raw 

391 Gly Ala Asp Val Val Gly Ala He Pro His Phe Glu Phe Thr Arg Glu 

392 180 185 190 

394 Tyr Gly Val Glu Ser Leu His Lys Thr Phe Ala Leu Ala Gin Lys Tyr 

395 195 200 205 

397 Asp Arg Leu He Asp Val His Cys Asp Glu He Asp Asp Glu Gin Ser 

398 210 215 220 

400 Arg Phe Val Glu Thr Val Ala Ala Leu Ala His His Glu Gly Met Gly 

401 225 230 235 240 

403 Ala Arg Val Thr Ala Ser His Thr Thr Ala Met His Ser Tyr Asn Gly 

404 245 250 255 

406 Ala Tyr Thr Ser Arg Leu Phe Arg Leu Leu Lys Met Ser Gly He Asn 

407 260 265 270 

409 Phe Val Ala Asn Pro Leu Val Asn He His Leu Gin Gly Arg Phe Asp 

410 275 280 285 

412 Thr Tyr Pro Lys Arg Arg Gly lie Thr Arg Val Lys Glu Met Leu Glu 

413 290 295 300 

415 Ser Gly He Asn Val Cys Phe Gly His Asp Asp Val Phe Asp Pro Trp 

416 305 310 315 320 

418 Tyr Pro Leu Gly Thr Ala Asn Met Leu Gin Val Leu His Met Gly Leu 

419 325 330 335 

421 His Val Cys Gin Leu Met Gly Tyr Gly Gin He Asn Asp Gly Leu Asn 

422 340 345 350 

424 Leu He Thr His His Ser Ala Arg Thr Leu Asn Leu Gin Asp Tyr Gly 

425 355 360 365 

427 He Ala Ala Gly Asn Ser Ala Asn Leu He He Leu Pro Ala Glu Asn 

428 370 375 380 

43 0 Gly Phe Asp Ala Leu Arg Arg Gin Val Pro Val Arg Tyr Ser Val Arg 
431 385 390 395 400 

433 Gly Gly Lys Val He Ala Ser Thr Gin Pro Ala Gin Thr Thr Val Tyr 

434 405 410 415 
436 Leu Glu Gin Pro Glu. Ala He Asp Tyr Lys Arg 
437 
4798 
4799 
4800 
4801 
4802 
4803. 
4804 
4806 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



ID Np* 50 

^H: (2 7\ <sT) 

C: DN&r^ \ 



Artificial sequence 



420 

SEQ lw x.*£J> -\ ~r w S 

LENGTH: (2 7^ \s^<K$\ 

TYPE 

ORGANISM 
FEATURE: 

OTHER INFORMATION: Description of the artificial sequence: 
oligonucleotide primer 
<400> SEQUENCE: 50 
4807 ^g^gaatacg gcgtggagto g r 
4810 <210> SEQ ID NOr-61 ' 1 J- 

LENGTH : Re ) \/\ 

TYPE: DNA^ 

ORGANISM: Artificial sequence 
FEATURE : 

OTHER INFORMATION: Description of the artificial sequence: 
oligonucleotide primer 
<400> SEQUENCE: 51 




W--> 



4811 
4812 
4813 
4814 
4815 
4816 
4818 



<211> 
<212> 
<213> 
<220> 
<223> 




6> 



file://C:\CRF4\Outhold\VsrJ522341.htm 
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RAW SEIKnCE LISTING DSK: 02/17/2005 

PATENT APPLICATION: US/10/522,341 TIME: 12:04:44 

Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.raw 



E--> 4819 cggcaggata atcaggttgg 



4843 <211> LENGTH: 127 

4844 <212> TYPE: Dl 

4845 <213> ORGANISM: Artificial sequence 
W--> 4846 <220> FEATURE: 

4847 <223> OTHER INFORMATION: Description of the artificial sequence: 

4848 oligonucleotide primer 
4850 <400> SEQUENCE: 53 

E--> 4851 ^tcaacgtaa ccaaocctgc 20 

5506 <210> SEQ ID NO: 59 

5507 <211> LENGTH: 1011 

5508 <212> TYPE: DNA 

5509 <213> ORGANISM: Zea mays 
W--> 5510 <220> FEATURE: 

5511 <221> NAME /KEY : CDS 

5512 <222> LOCATION: (1) . . (981) 

5513 <223> OTHER INFORMATION: coding for 5 -methyl thioribose kinase 
5515 <400> SEQUENCE: 59 




5516 


gca 


cga 


gca 


etc 


etc 


tec 


tct 


cct 


etc 


gcc 


ggc 


gca 


teg 


ccc 


gac 


tgt 


48 


5517 


Ala 


Arg 


Ala 


Leu 


Leu 


Ser 


Ser 


Pro 


Leu 


Ala 


Gly 


Ala 


Ser 


Pro 


Asp 


Cys 




5518 


1 








5 










10 










15 






5520 


cag 


tea 


gcc 


tea 


gcc 


atg 


gcc 


gcg 


gag 


gag 


gag 


cag 


ggc 


ttc 


cgc 


ccg 


96 


5521 


Gin 


Ser 


Ala 


Ser 


Ala 


Met 


Ala 


Ala 


Glu 


Glu 


Glu 


Gin 


Gly 


Phe 


Arg 


Pro 




5522 








20 










25 










30 








5524 


ctg 


gac 


gag 


teg 


tec 


ctg 


etc 


gcc 


tac 


ate 


aag 


gcc 


acg 


ccg 


gcg 


etc 


144 


5525 


Leu 


Asp 


Glu 


Ser 


Ser 


Leu 


Leu 


Ala 


Tyr 


He 


Lys 


Ala 


Thr 


Pro 


Ala 


Leu 




5526 






35 










40 










45 










5528 


gcc 


tec 


cgc 


etc 


ggc 


ggc 


ggt 


ggc 


agt 


eta 


gac 


tec 


ate 


gag 


ate 


aag 


192 


5529 


Ala 


Ser 


Arg 


Leu 


Gly 


Gly 


Gly 


Gly 


Ser 


Leu 


Asp 


Ser 


He 


Glu 


He 


Lys 




5530 




50 










55 










60 












5532 


gag 


gtc 


ggc 


gac 


ggc 


aac 


etc 


aac 


ttc 


gtc 


tac 


ate 


gtg 


cag 


tec 


gag 


240 


5533 


Glu 


Val 


Gly 


Asp 


Gly 


Asn 


Leu 


Asn 


Phe 


Val 


Tyr 


He 


Val 


Gin 


Ser 


Glu 




5534 


65 










70 










75 










80 




5536 


gcc 


ggc 


gcc 


ate 


gtc 


gtc 


aag 


cag 


gcg 


etc 


ccg 


tac 


gtg 


cgc 


tgc 


gtg 


288 


5537 


Ala 


Gly 


Ala 


He 


Val 


Val 


Lys 


Gin 


Ala 


Leu 


Pro 


Tyr 


Val 


Arg 


Cys 


Val 




5538 










85 










90 










95 






5540 


ggg 


gat 


teg 


tgg 


ccc 


atg 


acg 


egg 


gag 


cgc 


gcc 


tac 


ttc 


gag 


gcc 


tec 


336 


5541 


Gly 


Asp 


Ser 


Trp 


Pro 


Met 


Thr 


Arg Glu Arg 


Ala 


Tyr 


Phe 


Glu 


Ala 


Ser 




5542 








100 










105 










110 








5544 


acg 


ctg 


egg 


gag 


cac 


ggc 


cgc 


ctg 


tgc 


ccg 


gag 


cac 


ace 


ccc 


gag 


gtg 


384 


5545 


Thr 


Leu 


Arg 


Glu 


His 


Gly 


Arg 


Leu 


Cys 


Pro 


Glu 


His 


Thr 


Pro 


Glu 


Val 




5546 






115 










120 










125 










5548 


tac 


cac 


ttc 


gac 


egg 


ace 


ttg 


teg 


ctg atg 


ggg 


atg 


cgc 


tac 


ate 


gag 


432 


5549 


Tyr 


His 


Phe 


Asp 


Arg 


Thr 


Leu 


Ser 


Leu 


Met 


Gly 


Met 


Arg 


Tyr 


He 


Glu 




5550 




130 










135 










140 












5552 


ccc 


ccg 


cac 


ate 


ate 


etc 


cgc 


aag 


ggc 


etc 


gtc 


gcc 


ggt 


gtc 


gag 


tac 


480 


5553 


Pro 


Pro 


His 


He 


He 


Leu 


Arg 


Lys 


Gly Leu 


Val 


Ala 


Gly 


Val 


Glu 


Tyr 




5554 


145 










150 










155 










160 
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RAW SE^ENCE LISTING dW: 02/17/2005 

PATENT APPLICATION: US/10/522,341 TIME: 12:04:44 

Input Set : A:\sequence listing, txt 
Output Set: N:\CRF4\02172005\J522341.raw 



5556 


ccg ctg 


etc 


gee 


gac 


cac 


atg 


tec 


gat 


tac 


atg 


gee 


aag 


acg 


etc 


ttc 


528 


5557 


Pro 


Leu 


Leu 


Ala 


Asp 


His 


Met 


Ser 


Asp 


Tyr 


Met 


Ala 


Lys 


Thr 


Leu 


Phe 




5558 










165 










170 










175 






5560 


ttc 


acc 


tec 


etc 


etc 


tat 


aac 


aat 


acc 


acg 


gat 


cat 


aag 


aac 


gga gtt 


576 


5561 


Phe 


Thr 


Ser 


Leu 


Leu 


Tyr Asn 


Asn 


Thr 


Thr 


Asp 


His 


Lys 


Asn 


Gly Val 




5562 








180 










185 










190 








5564 


get 


aag 


tac 


tct 


gcg 


aac 


gtg 


gag 


atg 


tgt 


agg 


etc 


acg 


gag 


caa 


gtt 


624 


5565 


Ala 


Lys 


Tyr 


Ser 


Ala 


Asn 


Val 


Glu 


Met 


Cys 


Arg 


Leu 


Thr 


Glu 


Gin 


Val 




5566 






195 










200 










205 










5568 


gtg 


ttc 


teg 


gac 


cca 


tac 


cgt 


gtt 


tec 


aaa 


ttt 


aat 


egg 


tgg 


acc 


teg 


672 


5569 


Val 


Phe 


Ser 


Asp 


Pro 


Tyr Arg 


Val 


Ser 


Lys 


Phe 


Asn 


Arg 


Trp 


Thr 


Ser 




5570 




210 










215 










220 












5572 


cct 


tat 


etc 


gac 


aaa 


gat 


get 


gag 


gca 


gtt 


cgc 


gag 


gat 


gat 


gag 


etc 


720 


5573 


Pro 


Tyr 


Leu 


Asp 


Lys 


Asp Ala 


Glu 


Ala 


Val 


Arg 


Glu 


Asp 


Asp 


Glu 


Leu 




5574 


225 










230 










235 










240 




5576 


aag 


ttg 


gaa 


gta 


get 


ggg 


ctg 


aaa 


teg 


atg 


ttt 


ate 


gag 


aga 


get 


caa 


768 


5577 


Lys 


Leu 


Glu 


Val 


Ala 


Gly Leu 


Lys 


Ser 


Met 


Phe 


He 


Glu 


Arg 


Ala 


Gin 




5578 










245 










250 










255 






5580 


get 


ctg 


att 


cat 


gga 


gat 


etc 


cac 


act 


ggt 


tct 


ate 


atg 


gtg 


acc 


gaa 


816 


5581 


Ala 


Leu 


He 


His 


Gly 


Asp 


Leu 


His 


Thr 


Gly 


Ser 


He 


Met 


Val 


Thr 


Glu 




5582 








260 










265 










270 








5584 


gtt 


caa 


etc 


aag 


tea 


ttg 


ate 


cag 


aat 


ttg 


ggt 


tct 


atg 


ggg 


cca 


atg 


864 


5585 


Val 


Gin 


Leu 


Lys 


Ser 


Leu 


He 


Gin 


Asn 


Leu 


Gly 


Ser 


Met 


Gly 


Pro 


Met 




5586 






275 










280 










285 










5588 


ggg 


ttt 


gat 


att 


ggg 


age 


ctt 


cct 


tgg 


aaa 


cct 


gat 


ttt 


ggg 


cat 


act 


912 


5589 


Gly Phe 


Asp 


He 


Gly 


Ser 


Leu 


Pro 


Trp 


Lys 


Pro 


Asp 


Phe 


Gly 


His 


Thr 




5590 




290 










295 










300 












5592 


atg 


cac 


aga 


atg 


ggc 


atg 


ctg 


ate 


aag 


cga 


atg 


ate 


gta 


agg 


ctt 


aca 


960 


5593 


Met 


His 


Arg 


Met 


Gly 


Met 


Leu 


He 


Lys 


Arg 


Met 


He 


Val 


Arg 


Leu 


Thr 




5594 


305 










310 










315 










320 




5596 


aga 


atg 


gat 


ctt 


gaa 


gac 


aat 


tgaagagtcg tggaatttgt tccacaaaas 




1011 


E--> 5597 


arg 


met 


asp 


Leu 


glu 


^sp 


asn 
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<210> 55 
<211> 5674 
<212> DNA 
<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: vector 
construct pBluKS-nit P-STLS1-35S-T 

<400> 55 

ccagcttttg ttccctttag tgagggttaa tttcgagctt ggcgtaatca tggtcatagc 60 

tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 120 

taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 180 

cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 240 

gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 300 

tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 360 

tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 420 

ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 480 

agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 540 

accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 600 

ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 660 

gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 720 

ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 780 

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 840 

taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag 900 

■tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 960 

gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 1020 

cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 1080 

agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 1140 

cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 1200 

cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 1260 

ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 1320 

taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 1380 

tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 1440 

ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 1500 

atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 1560 

gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 1620 

tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 1680 

cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 1740 

taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 1800 

ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 1860 

ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 1920 

cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 1980 

ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 2040 

gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 2100 

gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 2160 

aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgcg ccctgtagcg 2220 

gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg 2280 

ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc 2340 

cccgtcaagc tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc 2400 

tcgaccccaa aaaacttgat tagggtgatg gttcacgtag tgggccatcg ccctgataga 24 60 

cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa 2520 

ctggaacaac actcaaccct atctcggtct attcttttga tttataaggg attttgccga 2580 

tttcggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg aattttaaca 2640 

aaatattaac gcttacaatt tccattcgcc attcaggctg cgcaactgtt gggaagggcg 2700 

atcggtgcgg gcctc.ttcgc tattacgcca gctggcgaaa gggggatgtg ctgcaaggcg 27 60 

attaagttgg gtaacgccag ggttttccca gtcacgacgt tgtaaaacga cggccagtga 2820 

attgtaatac gactcactat agggcgaatt ggagctcgtc gagaccagat gttttacact 2880 



tgaccgtaaa tgagcaccS^aagaaaccgg 
aagatgactc aaacaagtaa tcggttgtga 
gtcaagttca aaatgttatg ttgagtttca 
ttcgttaata atggaagaga accaattctc 
aatttgagtg ttcaattctg gatgttgtgt 
tattatgaat ctggctttga tttcagttgt 
catttagaat gtttaataat taagttactg 
acatatatta atttattttt cttaaaaatg 
tagaagctat gctaaaccat tacgtatatg 
tatatatatg gatggctgtc acaatcagaa 
cataaaaaga ctatttcgtt tcattgacaa 
agtggaattt gagttcctgc aagtaagaaa 
tttcttttat ctgaaagctg caatgaaata 
gggtttggtt ttcttgatgc gaactaattg 
attcaaaaat tttgatttct cttgtaggaa 
ccaataatag taaatccaaa gatatttgac 
tttatttaat ccagtttctc tgagataatt 
taagattaaa taggagccaa actcacattt 
aaaatctata cgtcaaattt tatttatgac 
tacagctaag acaaaccaaa aaaaaaatac 
tcaatattat taggtagaat cgtacgacca 
atatctataa catagtatat attattacct 
gatttaagga atccataaaa gaataaattc 
aagtttttta tttgtttttt atttggtagt 
attgtgtcca aatactttgt aaaccaccgt 
ttttatattc agacataaag catcggctgg 
ctcaattatt agtaggtttg atgtgaactt 
agaaaagaaa ccacgtgggc tagtcccacc 
cgttaccgtc tccaatagga aaataaacgt 
tatattgctt ctctcaagtc tctgagtgtc 
ctgcaggcat gcaagcttat cgtcgactac 
tataataatt atcattaatt agtagtaata 
agaatgtagt atatagcaat tgcttttctg 
aacttttcta atatatgacc aaaatttgtt 
tcggtacgct gaaatcacca gtctctctct 
aataatgtgt gagtagtttc ccgataaggg 
tgtgttga'gc atataagaaa c'ccttagtat 
taaaatttct aattcctaaa accaaaatcc 
tatagatctt tgtcgtgaat ataaaccaga 
ccgttcgaag ctagaagtac cgcttaggca 
gggttggtta cgttgactcc cccgtaggtt 
aaggaggaag acaaggaagg ataaggttgc 
ttgatagagg tacgctacta tacttatact 
ctataccccc taataacccc ttatcaattt 
aaattggtat cagagccatg aataggtcta 
caaaatacga aagagttctt aactctaaag 
ccctcacacc ggtgacgggg atcgcgatgg 



Se^lbttSS y°/ 

tcacattcat ttcgaaggtg^^Tgaaagcgg 
ttcgtcagtt catgtcactc ctatgaagga 
aacttttatg ctaaactttt tttctttatt 
ttgtatctaa agattatcca tctatcatcc 
taccctacat tctacaacca tgtagccaat 
gttcttttct tttttttctt tgcatatttg 
tatttccaca tacattagtt ccaagaatat 
ttttggaatg actaatattg acaacgaaaa 
tgacttcaca tgttgttgtt ttacattccc 
acgtgatcga aaaaagacaa acagtgtttg 
tttgtgttta tttgtaaaga aaagtggcaa 
gatgaaataa aagacttgag tgtgtgtttt 
ttcctaccaa gcccgtttga ttattaattg 
gttatataag aaactataca atccatgtta 
tatgatttac tatatgagac tttcttttcg 
cggaccaaaa cacattgatc tattttttag 
cattaaggaa aacttagtat taacccatcc 
caaatattaa ataacataaa atggatttaa 
atttcttatt taaatttata tttaatgaaa 
tttctaagtg gtccaaaaca tcaattccgt 
aaaaaaggta ggttaatacg aattagaaac 
attatgagga atcaaaatgc atcaaatatg 
tacgggaaaa aaaatggaat aaattctttt 
tctccatttt gttttatttc 
tgtaattctt aaacggggtt 
tttaatcaat caatagattt 
tacaaaaaaa acaaaaacaa 
ttgtttcatt tccaccacag 




gaccacaaaa aaaaaacaaa 
atgaaccaaa gtaaaaaaca 
gtaagtttct 
taatatttca 
tagtttataa 
gatgtgcagg 
acaaatctat 

gtatttgt 
agtactaaaa 
cacgagacga 
ggaggccgtt 
tggtttaaat 
aggccctgtg 
atacgctaag 
aagaaataat 
tgaccaaaac 
ataaaagatc 
gtac 



gtttggattt 
ttcacttctt 
tatttttctt 
atcaatgcag 
gttcgatctt 
aaaaagtcta 
aagactcgac 
gcttctacct ttgatatata 
aatatttttt tcaaaataaa 
gtgtgtatat tttaatttat 
tatcaccgga tccatcgaat 
ctctctctat tttctc cata 
tcttj 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 



:ttcgctca 
:aaaata cttctatcaa 
tccagatctc ctaaagtccc 
ctaaacctgg agcccagacg 
agggaaaaga tgctaaggca 



atgatgaagt 
caaggtaaga 
ggaatgcttg 
ccgcataagc 
tcaagaggat 
tttcaagatc 



ggacggaagg 
agatggaaat 
tatttatacc 
ccccgcttaa 
aaaacctcac 
aaaactagtt 



5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5674 




See. erra*- 




Page 



VARIaH^LOCATION SUMMARY DIM: 02/17/2005 

PATENT APPLICATION: US/10/522 , 341 TIME: 12:04:45 





Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.xaw 




Use of n's or Xaa's(NEW RULES); 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 
Use of <220> t o <223> i s MANDATORY if m f( t s or Xaa's ar ^ present . 
i n <220> to <2j3> s ection, please explai n location of n or Xaa, and which 
resi due n or'Xaa represe nts . ~ ' ■ 

Seq#:55; N Pos . 5014 
Seq#:57; N Pos . 6697 
Seq#:59; N Pos. 1026 
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2/17/05 



VERIFUPHON SUMMARY 

PATENT APPLICATION: 



US/10/522,341 



TIME: 



Page ^ 



02/17/2005 
12:04:45 



Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.raw 

10 M:270 C: Current Application Number differs, Replaced Current Application Number 

11 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
W: Missing Blank Line separator, <220> field identifier 
W: Missing Blank Line separator, <220> field identifier 
W: Missing Blank Line separator, <220> field identifier 
W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:4 
E: Mandatory Header Field missing, <220> Tag not found for SEQ ID#:4^ 
W: Missing Blank Line separator, <220> field identifier 
W: Missing Blank Line separator, <220> field identifier 
W: Missing Blank Line separator, <400> field identifier 
W: Missing Blank Line separator, <220> field identifier 

W: Missing Blank Line separator, <220> field identifier 



228 
231 
236 
354 
357 
443 
641 
645 
866 
1092 
1199 
1384 
1568 
1713 
1802 
1891 
2014 
2260 
2606 
2834 
3185 
3413 
3642 
3754 
3847 
4039 
4225 
4411 
4597 
4782 
4802 
4807 
4814 



M:283 
M:283 
M:283 
M:258 
M:200 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:336 
M:283 
M:283 
M:283 




283 
283 
283 
283 
252 
283 



4819 M:252 



4826 
4846 
4851 
4857 
4869 
4957 



283 
283 
252 
283 
283 
258 



4957 M:258 
4957 M:341 
4975 M:283 
5087 M:283 
5203 M:258 
5203 M:258 



W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
E: 
W: 
E: 
W: 
W: 
E: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 



Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 



identifier 
identifier 
identifier 
identifier 
field identifier 
field identifier 
identifier 
identifier 
identifier 
identifier 
identifier 
identifier 
identifier 



field 
field 
field 
field 



field 
field 
field 
field 
field 
field 
field 



Invalid Amino Acid Number in Coding Region, SEQ ID: 37 



Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <220> 

Missing Blank Line separator, <22 0> 



field identifier 
field identifier 
field identifier 
field identifier 
field identifier 
field identifier 
field identifier 



No. of Seq. differs, <211> LENGTH: Input : 27 Found: 21 SEQ: 50 
Missing Blank Line separator, <220> field identifier ^/ 
No. of Seq. differs, <211> LENGTH: Input : 26 Found: 20 SEQ: 51 
Missing Blank Line separator, <220> field identifier 
Missing Blank Line separator, <220> field identifier 
No. of Seq. differs, <211> LENGTH : Input : 2 7 Found:20 SEQ:53 
Missing Blank Line separator, <220> field identifier^ y 
Missing Blank Line separator, <220> field identifier S 
Mandatory Feature missing, <221> Tag not found for SEQ 
Mandatory Feature missing, <222> Tag not found for SEQ 
(46) "n" or "Xaa" used, for SEQ ID#:55 after pos.:4980 
Missing Blank Line separator, <220> field identifier 
Missing Blank Line separator, <220> field identifier 
Mandatory Feature missing, <221> Tag not found for SEQ 
Mandatory Feature missing, <222> Tag not found for SEQ 



ID#:55 
ID#:55 



ID#:57 
ID#:57' 
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verification summary 

PATENT APPLICATION: US/10/522,341 





W?TE: 02/17/2005 
TIME: 12:04:45 



Input Set : A:\sequence listing.txt 
Output Set: N:\CRF4\02172005\J522341.raw 



L:5203 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:57 after pos.:6660 

L:5262 M:283 W: Missing Blank Line separator, <220> field identifier 

L:5510 M:283 W: Missing Blank Line separator, <220> field identifier 

L:5597 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:59 after pos.:1011 

L:5597 M:254 E: No. of Bases conflict, LENGTH : Input : 325 Counted:1032 SEQ:59 

L:5597 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS : 6 

L:5597 M:112 C: (48) String data converted to lower case, 

L:5597 M:252 E: No. of Seq. differs, <211> LENGTH : Input : 1011 Found: 1032 SEQ: 59 

L:5673 M:283 W: Missing Blank Line separator, <220> field identifier 

L:5761 M:283 W: Missing Blank Line separator, <220> field identifier 

L:5842 M:283 W: Missing Blank Line separator, <220> field identifier 

L:5923 M:283 W: Missing Blank Line separator, <220> field identifier 

L:6003 M:283 W: Missing Blank Line separator, <220> field identifier 

L:6015 M:283 W: Missing Blank Line separator, <220> field identifier 

L:6027 M:283 W: Missing Blank Line separator, <220> field identifier 
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